7 research outputs found
The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations
The Parallel Meaning Bank is a corpus of translations annotated with shared,
formal meaning representations comprising over 11 million words divided over
four languages (English, German, Italian, and Dutch). Our approach is based on
cross-lingual projection: automatically produced (and manually corrected)
semantic annotations for English sentences are mapped onto their word-aligned
translations, assuming that the translations are meaning-preserving. The
semantic annotation consists of five main steps: (i) segmentation of the text
in sentences and lexical items; (ii) syntactic parsing with Combinatory
Categorial Grammar; (iii) universal semantic tagging; (iv) symbolization; and
(v) compositional semantic analysis based on Discourse Representation Theory.
These steps are performed using statistical models trained in a semi-supervised
manner. The employed annotation models are all language-neutral. Our first
results are promising.Comment: To appear at EACL 201
The Annotators Did Not Agree on Some of the Guidelines Examples
International audienceWe propose the annotation of 7 sentences out of the 31 provided in the ISA-17 shared task, according to our understanding of the guidelines. We include here several remarks to improve the annotation and provide some tools to make the task easier
Quantification Annotation in ISO 24617-12, Second Draft
International audienceThis paper describes the continuation of a project that aims at establishing an interoperable annotation scheme for quantification phenomena as part of the ISO suite of standards for semantic annotation, known as the Semantic Annotation Framework. After a break, caused by the Covid-19 pandemic, the project was relaunched in early 2022 with a second working draft, which deals with certain issues in the annotation of quantification in a more satisfactory way than the original first working draft
Multityped Abstract Categorial Grammars and Their Composition
International audienceThis article introduces multityped abstract categorial grammars and show that a suitable composition operation can be defined
Quantification Annotation in ISO 24617-12, Second Draft
International audienceThis paper describes the continuation of a project that aims at establishing an interoperable annotation scheme for quantification phenomena as part of the ISO suite of standards for semantic annotation, known as the Semantic Annotation Framework. After a break, caused by the Covid-19 pandemic, the project was relaunched in early 2022 with a second working draft, which deals with certain issues in the annotation of quantification in a more satisfactory way than the original first working draft